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THE VON NEUMANN INEQUALITY FOR 3x3 

MATRICES 

GREG KNESE 


1. Introduction 

The purpose of this note is to explain how recent results of Kosihski 
[T3] provide a proof of the von Neumann inequality for d-tuples of 3 x 3 
commuting contractive matrices. 

Definition 1.1. A d-tuple of pairwise commuting contractive matrices 
or operators T = (Ti,... ,Td) satishes the von Neumann inequality if 
for every p G 'C[zi ,..., 

||p(r)|| < sup \p{z)\. 

Recall that contractive means llTdl < 1 and is the unit d-torus in 

The von Neumann inequality holds for a single contractive operator— 
von Neumann’s original result [15]— as well as for a pair of commuting 
contractions—a result of Ando [2|. For d > 2 there are known examples 
of d-tuples of commuting contractions for which the von Neumann in¬ 
equality fails. Varopoulos ini proved the existence of counterexamples 
with a probabalistic argument and later Kaijser and Varopoulos (see 
addendum to ml) as well as Crabb and Davie [1] found explicit coun¬ 
terexamples. These counterexamples were all given by hnite matrices. 
It turns out that the von Neumann inequality holds for d-tuples of 2 x 2 
commuting contractive matrices; this result is essentially equivalent to 
the Schwarz lemma on the polydisc. On the other hand, Holbrook 
[9] has found a 3-tuple of 4 x 4 matrices which fail the von Neumann 
inequality. 

A great deal of effort has been expended to put in the last piece 
of this particular puzzle: Does the von Neumann inequality hold for 
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(i-tuples of 3 X 3 commuting contractive matrices? For evidence of 
interest in this question see pi^lTl]. 

Recent work of Kosihski proves that the answer is yes. 

Theorem 1.2. The von Neumann inequality holds for d-tuples of 3x3 
eommuting contractive matrices. 

This is also interesting in light of the fact that there exists a 4- 
tuple of 3 X 3 commuting contractions that do not dilate to commuting 
unitary operators |3]. Possessing a unitary dilation is a much stronger 
property than satisfying a von Neumann inequality; in fact, it means 
that a von Neumann inquality holds for matrix-valued polynomials of 
all matrix sizes. Nevertheless, all of the von Neumann inequalities cited 
above have dilation counterparts: the Sz.-Nagy dilation theorem vis-a- 
vis von Neumann’s inquality, Ando’s dilation theorem (Ando’s actual 
result in H), d-tuples of 2 X 2 commuting contractive matrices always 
dilate [6i[in]. The situation of a von Neumann inequality without a 
dilation theorem is not uncommon though, and it is closely related to 
the distinction between spectral sets and complete spectral sets. See 
[5] for more information. 

In the next two sections, we point out some known reductions and 
then explain how Kosihski’s work proves Theorem 11.21 In the third 
and final section, we explain some other operator theory related conse¬ 
quences of [13]. Namely, solvable three-point Pick interpolation prob¬ 
lems on the polydisc can always be solved with a rational inner function 
in the Schur-Agler class. 


2. Reductions 

Let T = (Ti,... ,Td) be a d-tuple of 3 x 3 commuting contractive 
matrices. To start with, we can assume the matrices are strict con¬ 
tractions (||Tj|| < 1, j = 1,..., d) and then the theorem will follow by 
continuity. Next, a d-tuple of 3 x 3 commuting matrices can always 
be perturbed to a simultaneously diagonalizable d-tuple of commuting 
matrices. The reference m points out several places where this is 
proven; see [71[8]llT]. 

After adjusting our operators to be nicer, we will replace polynomials 
with functions that are less nice. If T is now a d-tuple of commuting 
simultaneously diagonalizable strictly contractive 3x3 matrices, then 
it suffices to prove 

for all / : D'’* —)■ D holomorphic on the unit d-dimensional polydisc. 
This follows from the fact that such holomorphic functions can be ap¬ 
proximated locally uniformly on D'’* by polynomials p G C[xi,..., 
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with supremum norm at most 1 on D'^ (or by the maximum princi¬ 
ple). See Rudin [TB] . 

Using bounded holomorphic functions makes it possible to apply 
Mobius transformations to the matrices Ti,..., in order to force one 
of the joint eigenvalues of T to be 0 G C'^, while still maintaining all 
other properties of T. We can also apply a Mobius transformation to 
/ : —)■ D and assume /(O) = 0. 

Let 0 , 2 ;, w G be the joint eigenvalues of T with corresponding 
eigenvectors e,u,v G C^. Then, /(T) is the 3x3 matrix with eigenval¬ 
ues 0, (T = f(z),T = f{w) G D and eigenvectors e, u, v. It now becomes 
of interest to understand all holomorphic functions 5 ; : —)■ © which 

solve the following interpolation problem: 

0^0 

( 2 . 1 ) z\-^a 

w ^ T 

Theorem 11.21 will follow from the next result which is explained in 
the next section. 

Theorem 2.1 (Kosihski). If the interpolation problem fl2.ip can be 
solved with o' : ©“^ —)■ D holomorphic, then there exist holomorphic 
Fi,F 2 : ©^ —t © such that fl2.1|) can be solved with a function of the 
form 

(2.2) F{z) = Fi{F 2 {zi, Z 2 ), Z 3 ) 
after possibly permuting the variables. 

Indeed, by Ando’s inequality S = F 2 (Ti,T 2 ) is a contraction com¬ 
muting with T 3 and therefore F{T) = Fi(S', T 3 ) is a contraction equal 
to f{T) as above. This proves Theorem 11.21 given Theorem 12. II 

3. The three point Pick problem on the polydisc 

Theorem 12.11 follows from work in [T3] after making some further 
reductions to put us in the most interesting situation (that of extremal 
and non-degenerate interpolation problems). 

First, it is useful to perturb the nodes 0 , 2 ;= {zi,..., Zd),w = {wi, ..., wf) 
into a more generic position. Let p{a,b) = be the pseudo- 

hyperbolic distance on the unit disk. Perturb z, w so that all of the 
quantities below are distinct, yet T is still strictly contractive: 

| 2 :i|,. .., \zd\, |tci|,..., \wd\, p{zi,wi), .. .,p{zd,Wd). 
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We will see momentarily why this is useful. Functions as in (I2.2p form 
a normal family so we can approximate the less generic interpolation 
problems by the generic ones. 

The interpolation problem 12.11 is said to be extremal if it cannot be 
solved with a holomorphic function g satisfying suppd \g\ < 1. There is 
no harm in multiplying a, r by r > 1 if necessary to force the problem 
to be extremal. 

The interpolation problem 12. II is said to be non-degenerate if no two- 
point subproblem is extremal. If a two point subproblem is extremal 
(the degenerate case) then one of the following holds 

|cr| = max \zj\ or |r| = max \wj\ or p(cr, r) = max p{zj,Wj). 

j=l,...,d j=l,...,d j=l,...,d 

By our genericity assumption, whichever maximum occurs above will 
occur at a unique j and this forces the solution function to be unique 
and to depend on one variable. We provide some details in Lemma 13.11 
at the end of this section. 

Thus, in the degenerate case we can certainly solve with a function 
of the form fl2.2p and we may now assume our interpolation problem is 
non-degenerate. 

To hnish, we may quote appropriate results from |T3]. Lemma 3 of 
[T3] states that for d = 3, if the interpolation problem fl2.ip is extremal, 
non-degenerate, and strictly 3-dimensional, then it can be solved with 
a function of the form fl2.2p . Strictly 3-dimensional means the problem 
can be solved with a function depending on 3 variables but not with 
a function only depending on 2 variables. We may certainly assume 
that we are in the strictly 3-dimensional case since otherwise there is 
nothing to prove. 

For d > 3, Lemma 5 of [13] states that if fl2.ip is extremal and non¬ 
degenerate, then after permuting variables if necessary the problem 
can be solved with a function of the form fl2.2p . This completes our 
explanation of Kosihski’s Theorem 12.11 

We used the following fact above. 


Lemma 3.1. Suppose / : ^ D zs holomorphic, /(O) = 0 and there 

exists tn G such that f{w) = Wi and |tni| > \wj\ for all j ^ 1. Then, 
f{z) = Zi for all z G D'^. 

Proof. For C ^ D, let h{() = /(^tc). Then, h{0) = 0, h{wi) = Wi and 
therefore by the classical Schwarz lemma, h{f) = f. This implies 



Wi 


Wi 


1. 
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By [16] (page 179), 


91 

dZj 


( 0 ) 


< 1 . 


Since |wi| > \wj\ for j 7^ 1, 


this can only happen if ^(0) = 1. This implies /(C, 0,..., 0) = C- By 
Lemma 3.2 of [12], this implies f[z) = zi. □ 


4. The Schur-Agler class 

The Schur-Agler class on the polydisc consists of holomorphic 
functions / : —)■ D such that 

II/(T)||<1 

for all d-tuples T of commuting strictly contractive operators. Func¬ 
tions of the form fl2.2p are certainly in the Schur-Agler class; indeed 
the argument after the statement of Theorem 12.11 proves this. 

Thus, three-point Pick interpolation problems on the polydisc can 
be solved with functions in the Schur-Agler class. However, the Schur- 
Agler class has a well-known interpolation theorem due to Agler; see 
PP. Combining these observations we get the following. 

Theorem 4.1. Given zi,Z 2 ,z^ G and ti,t 2 ,t 3 G there exists a 
holomorphic function / : D'^ ^ D satisfying f{zj) = tj for j = 1,2,3 if 
and only if there exist positive semi-definite 3x3 matrices T^, T^,..., T'^ 
such that 

d 

1 - ‘A = ^(1 - 

n=l 

for j,k = 1,2,3. Here superscripts are used to denote components of 

Zj e 

This result can be used to prove that every solvable 3-point Pick 
problem can be solved with a rational inner function in the Schur-Agler 
class. The paper [13] already proves this in the case of non-degenerate 
extremal problems but then the above general machinery can be used 
to show that every solvable problem, whether extremal or not, has a 
rational inner solution. 
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